Search CORE

77,044 research outputs found

Inner product computation for sparse iterative solvers on\ud distributed supercomputer

Author: Gu T. -X.
Liu X. -P.
Zhu S. -X.
Publication venue
Publication date: 01/01/2012
Field of study

Recent years have witnessed that iterative Krylov methods without re-designing are not suitable for distribute supercomputers because of intensive global communications. It is well accepted that re-engineering Krylov methods for prescribed computer architecture is necessary and important to achieve higher performance and scalability. The paper focuses on simple and practical ways to re-organize Krylov methods and improve their performance for current heterogeneous distributed supercomputers. In construct with most of current software development of Krylov methods which usually focuses on efficient matrix vector multiplications, the paper focuses on the way to compute inner products on supercomputers and explains why inner product computation on current heterogeneous distributed supercomputers is crucial for scalable Krylov methods. Communication complexity analysis shows that how the inner product computation can be the bottleneck of performance of (inner) product-type iterative solvers on distributed supercomputers due to global communications. Principles of reducing such global communications are discussed. The importance of minimizing communications is demonstrated by experiments using up to 900 processors. The experiments were carried on a Dawning 5000A, one of the fastest and earliest heterogeneous supercomputers in the world. Both the analysis and experiments indicates that inner product computation is very likely to be the most challenging kernel for inner product-based iterative solvers to achieve exascale

Oxford University Research Archive

Minimizing synchronizations in sparse iterative solvers for distributed supercomputers

Author: Gu T.-X.
Liu X.-P.
Zhu S.-X.
Publication venue
Publication date: 01/01/2013
Field of study

Eliminating synchronizations is one of the important techniques related to minimizing communications for modern high performance computing. This paper discusses principles of reducing communications due to global synchronizations in sparse iterative solvers on distributed supercomputers. We demonstrates how to minimizing global synchronizations by rescheduling a typical Krylov subspace method. The benefit of minimizing synchronizations is shown in theoretical analysis and is verified by numerical experiments using up to 900 processors. The experiments also show the communication complexity for some structured sparse matrix vector multiplications and global communications in the underlying supercomputers are in the order P1/2.5 and P4/5 respectively, where P is the number of processors and the experiments were carried on a Dawning 5000A

Oxford University Research Archive

On the spectral distribution of kernel matrices related to\ud radial basis functions

Author: Wathen A. J.
Zhu S.-X.
Publication venue
Publication date: 01/01/2012
Field of study

This paper focuses on the spectral distribution of kernel matrices related to radial basis functions. The asymptotic behaviour of eigenvalues of kernel matrices related to radial basis functions with different smoothness are studied. These results are obtained by estimated the coefficients of an orthogonal expansion of the underlying kernel function. Beside many other results, we prove that there are exactly (k+d−1/d-1) eigenvalues in the same order for analytic separable kernel functions like the Gaussian in Rd. This gives theoretical support for how to choose the diagonal scaling matrix in the RBF-QR method (Fornberg et al, SIAM J. Sci. Comput. (33), 2011) which can stably compute Gaussian radial basis function interpolants

Oxford University Research Archive

Recommended from our members

How Service-Centric Systems Change the Requirements Process

Author: Jones S.
Maiden N.
Zachos K.
Zhu X.
Publication venue
Publication date: 01/01/2005
Field of study

City Research Online

Perturbation theory of von Neumann Entropy

Author: Chen X Y
Chen Xiao-Yu
Gao Y F
Holevo A S
Holevo A S
Reed M
Zhu X
Publication venue: 'IOP Publishing'
Publication date: 26/02/2009
Field of study

In quantum information theory, von Neumann entropy plays an important role. The entropies can be obtained analytically only for a few states. In continuous variable system, even evaluating entropy numerically is not an easy task since the dimension is infinite. We develop the perturbation theory systematically for calculating von Neumann entropy of non-degenerate systems as well as degenerate systems. The result turns out to be a practical way of the expansion calculation of von Neumann entropy.Comment: 7 page

arXiv.org e-Print Archive

Crossref

Recommended from our members

Bioinspired Multifunctional Anti-icing Hydrogel

Author: He X
He Z
Hua M
Wang J
Wu C
Wu D
Wu S
Zhu X
Publication venue: eScholarship, University of California
Publication date: 04/03/2020
Field of study

The recent anti-icing strategies in the state of the art mainly focused on three aspects: inhibiting ice nucleation, preventing ice propagation, and decreasing ice adhesion strength. However, it is has proved difficult to prevent ice nucleation and propagation while decreasing adhesion simultaneously, due to their highly distinct, even contradictory design principles. In nature, anti-freeze proteins (AFPs) offer a prime example of multifunctional integrated anti-icing materials that excel in all three key aspects of the anti-icing process simultaneously by tuning the structures and dynamics of interfacial water. Here, inspired by biological AFPs, we successfully created a multifunctional anti-icing material based on polydimethylsiloxane-grafted polyelectrolyte hydrogel that can tackle all three aspects of the anti-icing process simultaneously. The simplicity, mechanical durability, and versatility of these smooth hydrogel surfaces make it a promising option for a wide range of anti-icing applications

eScholarship - University of California